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Tiered Web Site with Web-Tier Admission Control Based 
on Utilization Data Generated on Application Tier 

[00011 BACKGROUND OF THE INVENTION 

5 [0002] The present invention relates to computers and, more 
particularly, to "host" computer networks that respond to requests 
from external "chent" computers, A major objective of the present 
invention is to enhance the quahty of service associated with a host 
site on the World Wide Web. 

1 0 [0003] Much of modem progress is associated with the 
development of computers and the Intemet, which permits 
computers to communicate worldwide. Several protocols exist by 
which a host site, which may comprise one or more "server" 
computers, receives and processes messages from a number of 

1 5 other computers ("chents"). For example, HTTP (HyperText Transfer 

Protocol) is a communications protocol used on the World Wide Web 
to enable users to navigate within and among host sites- 

[00041 Messages can usually be grouped iato sessions, with 
each session having one or more related messages. For example, a 
20 session can consist of a message requesting information over the 
World Wide Web, and an associated response. Alternatively, a 
multiple-message session can consist of a commercial transaction, 
with related messages respectively used to locate within a web site a 
precise product, submit an order or billing and shipping 

2 5 information, and convey a confirmation of sale to a particular chent. 

Whether a host is to process just a siQgle message or a series of 
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related messages, it is usually important to quickly, accurately, and 
completely service each message and each session. 

[0005] The term "quality of service" refers to a host's abihty to 
respond quickly to individual messages and to complete sessions. 
5 As a particular host becomes more popular, and therefore receives 
more messages, the host's processing resources can become 
stretched. For example, due to heavy traffic, a host may not be able 
to respond to a message at all, or the host may not provide a timely 
response (which can cause a chent to "time-out" with an error or the 
1 0 impatient user to resend the message). Poor quahty of service can 
have significant impact, as users may become frustrated and simply 
give up trying to reach a particular host, and the sponsor of the host 
may lose sales or fail to communicate needed information to some 
cUents- 

1 5 [0006] Quahty of service can be improved by adding 
processing capacity and by implementing admissions control- 
Different approaches to increasing capacity are discussed further 
below. In many cases, it is cost effective to provide sufficient 
capacity to handle all messages most of the time, while relying on 

20 admissions control for peak demand situations. Even where 
additional hardware resources can be added on demand, peak usage 
can occur too suddenly for the additional capacity to be operational; 
the time from the spike being identified to the time additional 
capacity is available can be hours to days for manual operations and 

2 5 many minutes to hours for automatic or semi-automatic operations. 
During this time, admission control is often the best alternative to 
handling an immediate spike in usage, especially when the spike is 
short hved and far higher than the site's demand growth trend. 
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[0007] Admissions control involves selectively admitting and 
rejecting messages so that the messages that are processed are 
handled promptly and rehably. InteUigent admissions control can 
prioritize messages to advance the purposes of the host site. For 
5 example, messages associated v^th ongoing sessions can be given 
priority- U.S. Patent 6,006,269 to Phaal discloses a system in which 
admission determinations are made on a session basis and in which 
non-admitted messages are deferred with a higher priority level 
rather than rejected outright- Thus, opportunities associated with 
1 0 non-admitted messages are not necessarily lost. 

[00081 Preferably, the admissions decisions are based on 
utilization data gathered by the host site. All messages can be 
admitted when resource utiUzation is low. As it increases to 
potentially problematic levels, admissions can become increasing 

1 5 selective. However, since there is a potential detriment to the site 

whenever admissions control rejects or defers a message, admission 
control is not desirable on a steady-state basis. It is preferable that 
host-site capacity be able to process all messages most of the time 

[0009] In general, quahty of service can be improved by adding 
20 processing power to the host site. For example, in a single-server 
site, a less powerful server can be replaced with a more powerful 
server. An advantage of the single-server replacement approach is 
simphcity. A single computer handles all site functions. Programs 
running on that computer can handle message, monitor resources, 

2 5 and administer admissions control. 

[0010] On the other hand, there are limits to the single-server 
approach. At the high end, where the hmits of available technology 
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are pushed, fractional increases in power are quite costly. 
Furthermore, when the single server is down, the entire site is down. 
Also, a replacement strategy can be wasteful if the replaced server is 
no longer used. In practice, most large sites use multiple servers. 

5 [0011] Similarly configured severs can be arranged in parallel. 
Servers can be added in parallel as needed to increase capacity. 
A load-balancing mechanism can be added at the front end to 
distribute messages among the parallel servers. However, the 
parallel servers need to be coordinated to ensure session integrity, 

1 0 Also, while admissions control can be done independently by each 
server, this results in messages being rejected by one server while 
there is ample capacity on another. It is not practical for the 
parallel servers to commimicate with each other regarding each 
cUent request- While the coordination can, in principle, be 

1 5 performed at the load balancer, imposing additional processing 
requirements on the common node for all messages can result in an 
unacceptable performance bottleneck. 

[00121 A tiered host site overcomes many of the problems 
facing a multi-server host site. A typical tiered host site has a 

20 cUent-response tier (e.g., web tier using HTTP protocol) and an 
appUcation tier. The application tier performs most of the resource- 
intensive work regarding the purpose of the site, while cUent- 
response tier typically provides the cUent interface. For example, 
the chent-response tier can provide a web interface for a chent, 

25 while the appUcation tier handles e-commerce (electronic 
commerce) applications and maintains a database accessed by the 
appUcations. The appUcation tier can manage sessions, tagging 
responses so that subsequent messages in the session can be 
identified as such. The cUent-response tier can use the session tags 
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to generate cookies and/or links associated with session-specific 
URLs to direct subsequent client requests to the proper appUcation 
server. 

[0013] Each tier can be expanded independently as required. 
5 Since the client-response tier manages the distribution of messages 
to the appUcation tier, apphcation tier servers can be differentiated 
by function. Capacity can be increased by activating (adding, 
turning on, or re-allocating) a server dedicated to the stressed 
function so that the additional resources are not wasted on 
1 0 underutilized appUcations. 

[0014] The cUent-response tier can utiUze parallel similarly 
configxu'ed servers. Coordination among the parallel servers is 
faciUtated by the session tags added by the appUcation tier- 
Expansion of the cUent-tier capacity is then readily achieved by 

1 5 adding additional servers in paraUel. Altematively, the cUent- 
response tier can be configured as a load-balancing hub supported 
by other servers dedicated to specific cUent-response functions, 
such as encryption and decryption. While the hub approach does 
place additional burdens on the common-node load balancer, 

20 session tracking is stiU managed primarily at the appUcation tier. A 
hubbed cUent-response tier can be expanded efficiently by adding 
servers dedicated to a specific function that is over utiUzed. 

[0015] In a tiered site, the admissions control function is 
typicaUy assigned to the cUent-response tier since it provides the 
25 front-end interface to the cUent computer. The admission control 
function can monitor local resource utilization effectively on a per- 
server basis. The admission control function on the cUent-response 
tier can monitor response times associated with requests to the 
appUcation tier as a measure of its resource utiUzation. The 
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resource utilization information about the client-response tier and 
the apphcation tier then is used to determine the admissions 
control poUcy at any given time. 

[0016] While the tiered host site approach provides for 
5 efficient scaling and for effective admissions control, there is an 
insatiable demand for better performance. In particular, there is a 
demand for better admission control, since it is a software 
component that can, in principle, be upgraded less expensively than 
the host site hardware. What is needed is more effective admission 
1 0 control for a tiered host site. 

[001 7] SUMMARY OF THE INVENTION 

[0018] The present invention provides a network host site in 
which admission control determinations are made on a chent- 
response tier using resource utilization data collected on an 

1 5 apphcation tier. The invention further provides for admissions- 
control determinations that are also made as a function of resource- 
utihzation data collected on the chent-response tier. In addition to 
being used for admissions determinations, the resource utOization 
data can be used to request an increase or decrease of active server 

20 resources. 

[0019] The host site has distinct chent-response and 
apphcation tiers in that each host-site server is assigned to at most 
one of the tiers. Servers that only handle chent interfacing are 
assigned to the chent-response tier, while servers that run non- 
25 interfacing apphcations but do not handle chent interfacing are 
assigned to the apphcation tier. Servers that handle both non- 
interfacing apphcations and chent interfacing are assigned to the 
chent-response tier. 
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[0020] The invention provides for a variety of tier 
configurations. The chent-response tier can be configured as a 
single server, as a set of parallel servers with a load-balancing front 
end, or as a set of speciaUzed servers managed by a routing hub. 
5 The apphcation tier can have sub-tiers; for example, speciahzed 
appUcation servers can be serviced by one or more database servers 
on a back end. Alternatively, the apphcation tier can be arranged 
aroimd a routing hub. 

[0021] While apphcation- tier performance can be assessed by 
10 the chent-response tier, local apphcation-tLer resource monitoring 
has several advantages. First, the apphcation server as a whole can 
be measured independently of the network and even other servers 
(such as the database). Also it allows for a disaggregated measure 
of an apphcation's performance by resource components (for 
1 5 example, memory, CPU, I/O). Moreover, it aUows each apphcation to 
associate measurements with its own admissions classes and base 
them on session or even chent identity. 

[0022] FinaUy, resource monitoring can be customized on a 
per-server basis. Where multiple apphcations are ruiming on the 

20 same server, customization can be implemented on a per- 
apphcation basis. This allows for better optimization of apphcation 
resources. For example, the apphcation tier can generate resource 
utihzation data in XML ("Extensible Markup Language) format with 
instructions to the chent-response tier regarding interpretation and 

25 usage of the apphcation-tier resource utihzation data. These and 
other features and advantages of the present invention are apparent 
from the description below with reference to the foUowing 
drawings. 
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[00231 BRIEF DESCRIPTION OF THE DRAWINGS 

[0024] FIGURE 1 is a schematic diagram of a host site with 
parallel cUent-response-tier servers and comiected to multiple 
cUents in accordance with the present invention. 

5 [00251 FIGURE 2 is a schematic diagram of the host site of FIG, 
1 indicating internal commmiications paths and scalabiUty options. 

[0026] FIGURE 3 is a flow chart of a method of the invention 
implemented on the host site of FIG. 1. 

[0027] An alternative embodiment is described but not 
1 0 illustrated. 

[0028] DESCRIPTION OF THE PREFERRED EMBODIMENTS 

[0029] In accordance with the present invention, a host site 
API commimicates with chent computers 11 over the Internet, 
indicated by a connection 13, as shown in FIG. 1. In this case, host 

1 5 site API is an e-commerce site, and more specifically a flight- 
scheduhng and reservation-handling site. More generally, the 
invention provides for host sites with a wide range of purposes- 
The host site is divisible into an appUcation tier 15 and a web 
tier 17. In FIG. 1, chent computers are depicted as part of a chent 

20 tier 19. This is a conceptuahzatLon from the perspective of the host 
site and does not imply any special relationship among the chent 
computers other than the abihty to communicate with host site API. 

[0030] The host-site hardware is shown in greater detail in 
FIG. 2- Web tier 17 has three active servers WSl. WS2, WS3, and a 
25 load balancer 21. In periods of high demand, host site API can 
activate up to three additional servers WS4, WS5, and WS6. 

8 
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Application tier 15 includes a database server ASl and two front- 
end application servers AS 2 and ASS, w^hile an additional 
application server AS4 can be activated when additional resources 
are required- Servers WS4, WS5, WS6, and AS4 are normally inactive 
5 or used elsewhere in order to save costs. The relatively greater 
expandabihty of the web tier (indicated by having three servers in 
reserve versus one for the apphcation tier) corresponds to the 
greater computing power and state maintenance normally required 
of the apphcation servers, and thus the greater expense involved in 
1 0 adding servers to the apphcation tier. 

[0031] Apphcation- tier servers, may, but need not, run the 
same software. For example, apphcation server ASl runs a database 
program that organizes information regarding flight schedules, 
Apphcation server AS 2 runs a flight scheduling apphcation, while 

1 5 apphcation server ASS runs a flight reservation apphcation. In 
times of high demand, apphcation server AS4 can provide additional 
capacity for the flight-reservation function. Moreover, each 
apphcation server can run multiple apphcations, some of which may 
run on only one apphcation server, some of which may run on all 

20 apphcation servers, and some of which may run on more than one 
but not aU apphcation servers. However, the apphcation servers do 
not run the web-serving apphcations that interface directly v^th 
chent tier 19. 

[0032] Web servers WSl, WS2, and WS3, as weU as the other 
25 web servers when active, run respective copies of the same 
software. Each runs a web serving apphcation that provides the 
main interface to browsers running on chent computers 11. Load 
balancer 21 distributes chent requests in round-robin fashion to the 
active web servers. In addition, each server can run one or more 
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application programs other than web servers. However, the more 
complex appUcation programs are run on the application-tier 
servers. 

[0033] The common functions of servers on web tier 17 are 
5 shown in FIG. 1. Ghent requests are received at an admission 
control gateway 23, Gateway 23 uutially determines whether or not 
the request is encrypted. If it is, it is decrypted at encrypt/decrypt 
function 25. Once the request is decrypted or if it was not 
encrypted, admission control gateway 23 determines whether to 
10 pass it on to (software) chent-request processor 27, or to deferral 
manager 29. The decision is made as a function of the 
prograiniriing of admission control gateway 23 by deferral 
manager 29. 

[0034] In the event the request is passed to it, chent-request 
15 processor 27 determines whether it can respond to the request 
using local (web-tier-server) resources, or whether it needs to 
forward the request to application tier 15. In the former case, a 
chent response can be generated ta accordance with the cUent 
browser protocol (e.g., HTTP). The chent response is passed to 
20 admissions control gateway 23. If encryption is required, 
admissions control gateway 23 passes the chent-response to 
encrypt/decrypt function 25 and forwards the encrypted result to 
the chent that made the request. 

[0035] If the chent request caimot be met locaUy, chent- 
25 request processor 27 formulates an apphcation request for an 
apphcation processor 31 in the form of software running on 
apphcation tier 15. Since not all servers of apphcation tier 15 run 
the same apphcations, chent-request processor 27 must direct the 
apphcation request to the appropriate server and apphcation. If 

10 
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that target application is running on more than one application-tier 
server, cUent-request processor 27 must determine which of those 
servers to direct the apphcation request to. Generally, requests 
associated v^th the same session are directed to the same 
5 apphcation server. Otherwise, a selection can be made based on 
unused capacity, or more simply on a round-robin basis. 

[0036] Some apphcation requests can require processing on 
more than one server. For example, a request for flight-schedule 
information may require a flight-schedule apphcation on server AS 2 

10 (FIG. 2) to access database server ASl. However many apphcation 
servers are involved, an apphcation response is generated at 
apphcation tier 15 in response to the apphcation request. The 
apphcation tier can tag the apphcation response with session 
identification information so that subsequent chent requests in the 

1 5 same session can be correlated. 

[0037] The apphcation response is forwarded from apphcation 
processor 31 to the chent-request processor 27. TypicaUy, the 
apphcation response is transmitted to the web-tier server that 
generated the corresponding apphcation request. Alternatively, the 
20 selection of web-tier server for the apphcation response can be 
selected on some other basis, e.g., roimd robin. Once chent-request 
processor 27 receives an apphcation response, it generates a chent 
response, treating it in the manner described above for locaUy 
answered chent requests. 

25 [0038] Chent-request processor 27 can use the session 
identification information to generate a coolde or a unique URL. 
This URL can be communicated in the form of a web page link that 
generates a new chent request with the session information 

1 1 
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embedded. This enables a web-tier server to route a subsequent 
message in a session to the appropriate appUcation-tier server. 

[0039] Each application- tier server has a resource monitor 33. 
The parameters 35 to be monitored can be selected, but typically 
5 CPU usage is one parameter that is monitored. The parameter 
values determined through this monitoring are forwarded from the 
resource monitor to the deferral manager 2729. The resource 
monitor 33 of each appUcation server forwards this data to every 
web-tier deferral manager 29. The deferral manager 29 uses this 
10 information to determine the response of gateway 23 to client 
requests. 

[0040] Concurrently, the resource parameters associated with 
the web tier can be monitored. To this end, a web-tier resource 
monitor 37 on each web-tier server monitors usage including cUent- 
1 5 request processing and encryption and decryption activity. Note, 
the web-tier resource utiUzation data for each web-tier server can be 
provided to load balancer 21 (FIG, 2) so that it can take web server 
UtiUzation data into account when distributing chent requests. 

[0041] Depending on the programming, deferral manager 29 
20 can command gateway 23, in the alternative to admit all requests, 
admit no requests, admit only requests associated with on-going 
sessions, or admit requests according to some more complex 
criteria- Non-admitted requests can be forwarded to the deferral 
manager. The deferral manager can assign a time for a chent to 
25 resubmit the request. The deferral message is tagged so that a 
request in comphance with the deferral message is given 
preferential treatment at gateway 23. For example, resubmitted 
requests can be admitted while new requests are deferred- The 
deferral time can be set as a fimction of resource utihzation- In 

12 
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addition to controUiag admission criteria, the deferral manager can 
send a request to a resource manager on the apphcation tier to 
increase (or decrease) available resources. 

[0042] A method Ml of the invention practiced in the context 
5 of host site API is flovs^ charted in FIG. 3. Method Ml includes tv^o 
processes, a request-handling process PI and a resource-utihzation 
process P2- Request-handling process involves chent tier 19, and 
the two host-site tiers: web tier 17, and apphcation tier 15. 

[0043] An illustrative sequence for request process PI begins 
1 0 with a chent sending a request at step SI. Load balancer 21 (FIG. 2) 
receives this request and aUots it on a round robin basis, taking into 
account web-tier resource-utihzation data, at step S2 to a web-tier 
server, e.g., server WSl. The gateway for the receiving server can 
either admit or defer this request. In this illustrative sequence, the 
1 5 request is deferred at step S3. In this case, deferral manager 29 
generates and transmits to the cUent a deferral message indicating 
when the chent can resubmit the request. 

[0044] The chent resubmits the request by chcktng a hnk in the 
deferral message at step S4. Load balancer distributes this 2''*^ 
20 request to a web-tier server at a second iteration of step S2. In 
general, the deferred request is admitted at step S5. 

[0045] Unless the chent-request processor of the web-tier can 
respond on its own, it generates an apphcation request and 
transmits it to the apphcation tier at step S6. Each web-tier server 
25 knows which apphcations run on which apphcation- tier severs, so 
the request is directed to an appropriate apphcation-tier server. 
The receiving server generates an apphcation response at step S7. 
The receiving server may access other apphcation tier servers, 

13 
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including a database server, as necessary to formulate this 
application response. The apphcation tier can tag the appUcation 
response with session-identification information to keep track of 
sessions- Finally, the cUent-request processor generates a cUent 
5 response, using the session identification to generate a cookie or a 
unique URL that can be used in routing subsequent messages in a 
session. 

[0046] The chent response S8 may result lq a further chent 
request. Assimiing this is a continuation of a session, the session is 
1 0 identified in the new request. This session information is ignored 
by the load balancer, but used by the selected web-tier server to 
direct a new apphcation request to the appropriate apphcation tier 
server. 

[00471 Resource process P2 is concurrent with multiple 
1 5 iterations of steps S1-S8. At step Rl, apphcation tier resources are 
monitored at the apphcation tier. For example, each apphcation tier 
server can monitor its own resources. The monitor generates usage 
data at step R2. This data is transferred to the web tier and used to 
generate deferral criteria at step R3. The deferral criteria are used 
20 at the web tier to determine which requests are admitted (at step S5) 
and which steps are deferred (at step R3). 

[0048] In host site API, the web tier is arranged with servers 
operating in parallel. The invention provides for alternatives to the 
parallel web-tier arrangement. For example, the web tier can be 
25 implemented on a single server. Furthermore, a multi-server web 
tier can differentiate servers by function, for example, as in an 
alternative host site. 
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[0049] The alternative host-site interfaces with an alternative 
chent tier, which includes a number of cUent computers. The 
alternative host site includes an altemative apphcation tier and an 
alternative web tier. The altemative apphcation tier, which is 
5 similar to apphcation tier 15 of host site API, includes an 
altemative apphcation processing function, an altemative 
apphcation-tier resource monitor, and altemative resoiu-ce 
parameters. However, the altemative web tier of the altemative 
host site is quite distinct from the parallel arrangement of web tier 
10 17 of host site API. 

[0050] Altemative web tier 417 comprises six servers. Four of 
these servers are dedicated to specific functions that were shared by 
all servers in web tier 17 of host site API. One of the other two 
altemative servers provides additional processing for the 
1 5 encryption/decryption function. The other altemative server is a 
queue manager that adds a function of queuing chent and 
apphcation requests for serial routing by the altemative admissions 
control gateway. 

[0051] The altemative admissions control gateway performs 
20 the function of admission control gateway 23 and the load- 
balancing function of load balancer 21 of FIG. 2. It essentially acts 
as the load-balancing hub for the altemative web tier. It determines 
whether encryption or decryption is required and directs messages 
to one of the altemative encryption/decryption servers, as required. 
25 It implements deferral pohcy as directed by the altemative deferral 
manager. It directs appropriate chent requests to chent-request 
processor. Chent and apphcation requests are referred to the queue 
manager for ordered processing. The one function that is not 
routed through the altemative gateway is the resource monitoring 

15 
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function, in which resource utiUzation data is transmitted directly 
to the alternative deferral manager server . 

[0052] An advantage to the v^eb-tier hub approach of the 
alternative host site is that scaling can be done function by function, 
5 as it has been v^th encryption and decryption. Functions that are 
not heavily utihzed need not be expanded to new servers. On the 
other hand, functions can be combined on a server to avoid 
imderutihzation of hardware, 

[0053] The present invention has appUcabiUty to network 
1 0 communications and e-commerce. The invention is scaleable to any 
niunber of servers and provides for a wide variety of tier 
architectures with varying numbers of subtiers. These and other 
modifications to and variations upon the present invention are 
provided for by the present invention, the scope of which is defined 
1 5 by the following claims. 

[0054] What Is Claimed Is : 
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